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Abstract. For any continuous probability measure /i on R we construct an IFS with 
probabilities having \x as its unique measure-attractor. 



1. Introduction 

In 1981 Hutchinson [6] presented a theory of fractals and measures supported on 
fractals based on iterations of functions. 

Let {R. d ; f\,Pi, i — 1, ...,n} be an iterated function system with probabilities (IFSp). 
That is, : lR d — > M. d , i = l,...,n, are functions and Pi are associated non-negative 
numbers with Y^i=iVi = 1- If the rnaps fi : R d — > M. d are contractions, i.e. if there exists 
a constant c < 1 such that \fi(x) — fi(y)\ < c\x — y\, for all x,y G M, d , then there exists 
a unique nonempty compact set A satisfying 

(1) A = Uttfi(A) = { lim f h of t2 ..-o f ik ( x ); /;/,/;,.. E {1, nf}, 

for any x G M d , and a unique probability measure /i, supported on A, satisfying the 
invariance equation 

(2) /*(•) = EftMC/r'O), 

i=l 

see Hutchinson [6]. The set A is sometimes called the set-attractor, and fi the measure- 
attractor of the IFSp. 

The set-attractor A will have a self-repeating "fractal" appearance if all maps fi are 
similitudes, and the sets, fi(A), % = 1, ...,n, do not overlap. This leads to the intuition 
to regard the set-attractor A in ([T]) as being built up by n (in general overlapping and 
heavily distorted) "copies" of itself, and the measure-attractor as a "greyscale colouring" 
of the set-attractor. (Note that the probabilities pt play no role in the definition of A.) 

In general we can not expect to have a unique set-attractor if the IFS-maps are not 
assumed to be contractions or more generally if the limits lim^oo f it o f i% • • ■ o f ik (x) do 
not exist, with the limit being independent of x, for all i^i^... G {1, ...,n} N , but unique 
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measure-attractors exist if the limits 

(3) Z F (i 1 i 2 ...) := lim f h of i2 ---o f ik (x) 

k— >co 

exist (with the limit being independent of x) for almost alli^i^--- £ {1, ...,n} N . (Indeed, 
if the limit in exists a.s. then Z F may be regarded as a random variable, and its 
distribution /x(-) := P(Z F £ •), is then the unique solution to (T5]).) 

The theory of IFSp has a long pre-history within the theory of Markov chains, starting 
already with papers in the 30th by Doblin and others. Let {Xk}%L be the Markov chain 
obtained by random (independent) iterations with the functions, fi, chosen with the 
corresponding probabilities, p^. That is, let {Xk} be defined recursively by 

X k+1 = f Ik+1 (X k ), k>0, 

where {I^kLi is a sequence of independent random variables with P{h — i) — Pi, 
independent of X , where X is some given random variable. (It is well-know that any 
Markov chain {Xk} (with values in M. d ) can be expressed in the form Xk+\ = g(Xk, Yjt+i) 
where g : R d x [0, 1] ->• R d is a measurable function and {Y^}^ is a sequence of 
independent random variables uniformly distributed on the unit interval, see e.g. Kifer 

00 

If an IFSp has a unique measure-attractor, //, then \x is the unique stationary distri- 
bution of {Xk}, i.e. \i is the unique probability measure with the property that if X is 
/^-distributed, then {Xk} will be a (strictly) stationary (and ergodic) stochastic process, 
see e.g. Elton [5]. Therefore a unique measure-attractor can alternatively also be called 
a unique stationary distribution. 

Under standard average contraction conditions it follows that ([3D holds a.s., and the 
distribution of X k converges weakly to fi (with exponential rate quantified e.g. by the 
Prokhorov metric for arbitrary distributions of the initial random variable X ). Moreover 
the empirical distribution along trajectories of {Xk} converges weakly to \i a.s., and {Xk} 
obeys a central limit theorem. See e.g. Barnsley et al. [3], Diaconis and Freedman [1], 
and Stenflo [9] for details and further results. These papers also contains surveys of the 
literature. 

1.1. The inverse problem. The inverse problem is to, given a probability measure fi, 
find an IFSp having fi as its unique measure-attractor. This problem is of importance 
in e.g. image coding where the image, represented by a probability measure, can be 
encoded by the parameters in a corresponding IFSp in the affirmative cases, see e.g. 
Barnsley [1J. For an encoding to be practically useful it needs to involve few parameters 
and the distribution of X k needs to converge quickly to equilibrium (a property ensured 
by average contractivity properties of the functions in the IFSp) for arbitrary initial 
distributions of X . 

It is possible to construct solutions to the inverse problem in some very particular 
cases using Barnsley's "collage theorem", see [1] containing exciting examples of e.g. 
ferns and clouds (interpreted as probability measures on IR 2 ) and their IFSp encodings, 
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but typically it is very hard to even find approximate solutions to the inverse problem 
for general probability measures on R d . 

In this paper we present a (strikingly simple) solution to the inverse problem for 
continuous probability measures on R. 

2. Main result 

In order to present our solution to the inverse problem for continuous probability 
measures on R, recall the following basic facts used in the theory of random number 
generation; 

Let \x be a probability measure on R, and let F(x) = //((— oo, x]) denote its distribution 
function. The generalised inverse distribution function is defined by 

F -1 (V) = M{F{x) > u}, 0<u<l, 

and satisfies F~ 1 (F(x)) < x and F(F~ 1 (u)) > u and therefore 

F~ l {u) < x if and only if u < F(x). 

From this it follows that if U G U (0, 1), i.e. if U is a random variable uniformly distributed 
on the unit interval, then F~ l {U) is a /i-distributed random variable. This basic property 
reduces the problem of simulating from an arbitrary distribution on R, to the problem 
of simulating uniform random numbers on the unit interval. 

We say that /i is continuous if F is continuous. Note that fi({x}) = for any x G R 
for continuous probability measures in contrast with discrete probability measures where 
Sxes = 1 f° r some countable set S. 

If pi is continuous then F(F~ 1 (u)) = u, for < u < 1. This property is crucial for the 
following theorem; 

Theorem 1. A continuous distribution, /i, on R with distribution function, F, is the 
measure- attractor of the IFS with monotone maps fi(x) := F" 1 o « s o F(x), for any x 
with F(x) > 0, and probabilities pi = 1/n, where Ui(u) = u/n + (i — l)/n, < u < 1, 
i — 1, 2, n, for any n > 2. 

Proof. The Markov chain generated by Ui(x) = x/n + (i — l)/n, % = 1,2, ...,n, chosen 
with equal probabilities has the uniform distribution on the unit interval as its unique 
stationary distribution. That is, if {h}k>i is a sequence of independent random variables, 
uniformly distributed on {1,2, ...,n}, then 

(4) Z%(x)=u Ik o---o Ull (x), Z^(x)=x 

is a Markov chain starting at x G [0, 1] having the uniform distribution as its unique sta- 
tionary distribution. This can be seen by observing that Z%(x) has the same distribution 
as the reversed iterates 

(5) Z%(x)=u h o---ou Ik (x), Z^{x)=x, 
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for any fixed k, and the reversed iterates Z^(x) converges almost surely to the U(0, 1)- 
distributed random variable, Z u , where the k : th digit in the base n expansion of Z u is 
given by 4-1. 

If Z F denotes the limit of the reversed iterates of the system with /j chosen with 
probability 1/n, then 



where the last equality holds since F~ 1 (x) is non- decreasing, and since a monotone 
function can have at most a countable set of discontinuity points in its domain, it follows 
that is continuous for a. a. x G [0, 1] w.r.t. to the Lebesgue measure. 

From the above it follows that 



Remark 1. If X is a continuous \i- distributed random variable, then = u, 

so F(X) G U(0, 1). This contrasts the case when X is discrete where F(X) will also be 
discrete, so we cannot expect Theorem{J\to generalise to discrete distributions. 

If an IFS {R,fi,pi,i = l,...,n}, has a continuous measure- attractor \x being the dis- 
tribution of the a.s. limit of the reversed iterates, and the distribution function F of \x 
satisfies = x, for any x G JR., with < F(x) < 1, then, similarly, the IFS 

{[0, 1], Ui,Pi, i = l,...,n}, with Ui{u) := F o f t o < u < 1, has the U (0,1)- 

distribution as its unique stationary distribution. This is the case for absolutely contin- 
uous probability distributions fi if F is strictly increasing. 

Remark 2. From Theorem^ it follows that any continuous probability distribution on R 
can be approximated by the empirical distribution of a Markov chain {X^.} on K generated 
by an IFSp with trivial "randomness" generated by e.g. by a coin or a dice. 

Remark 3. Theorem^ may be used to represent a continuous probability measure fi on 
R by the functions suggested in the theorem. Note that there exist many iterated function 
systems with probabilities generating the same Markov chain, see e.g. Stenflo [8], so in 
particular it follows than an IFSp representation of a continuous probability measure 
on R is not unique. The given IFSp representation suggested by Theorem^ (for a given 
n > 2) is good in the sense that the generated Markov chain converges quickly to the given 
equilibrium making it possible to quickly simulate it. If the suggested IFSp representation 
cannot be described in terms of few parameters then it might make sense to consider an 
approximate representation by approximating the IFS functions with functions described 
by few parameters e.g. by using Taylor expansions. 




lim Z£(x) := lim f h o ■ ■ ■ o f h (x) 



(6) 



P(Z F <y) = P(F~\Z U ) <y) = P(Z U < F(y)) = F(y). 



□ 
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Remark 4. From TheoremUl it follows that if fi is a continuous probability measure on 
R being the measure- attractor of {IR; fi,Pi, i — 1, ■■■■,n}, with pi ^ 1/n for some n, then 
there exists another IFSp with uniform probabilities having fi as its measure- attractor . 

Example 1. Suppose F is a distribution function satisfying 

F(l-x) = l-F(x), 

and 

F(x)/2 = F(ax + b), for all < x < 1, 

where < b < 1/2, < a + b < 1/2, anda^O. 
Then 

F(x)/2 + 1/2 = 1 - F(l - x)/2 = 1 - F(a(l - x) + b) = F(ax + 1 - a - b). 

Thus random iterations with the maps f\(x) = ax + b, and f'2{x) = ax + 1 — a — b 
chosen with equal probabilities generates a Markov chain with stationary distribution fi 
having distribution function F. 

The case a = 1/3 and 6 = corresponds to F being the distribution function of the 
uniform probability measure on the middle-third Cantor set (the Devil's staircase). 



0.7 
0.-3 




The Cantor set is the set-attractor of the IFSp 
{R;h(x) = x/3,f 2 (x) = x/3 + 2/3, Pl = 1/2, p 2 = 1/2} and 
the distribution function of its measure-attractor (the uniform 
distribution on the Cantor set) is an increasing continuous function 
with zero derivative almost everywhere, with -F(O) = and 
F(l) = 1 popularly known as the "Devil's staircase". 

Example 2. Let fi be the probability measure with triangular density function 

.r < x < 1 



d(x) 



2-x 1 < x < 2 ' 



Then fi is the unique stationary distribution of the Markov chain generated by random 
iteration with the functions 

CtI 0<x<1 

f x ( x ) = \ A 
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and 




< x < 1 

1 < x < 2, ' 



chosen uniformly at random. 




Histograms of the first n points in a simulated random trajectory 
of the Markov chain. The empirical distribution along a trajec- 
tory converges weakly to the stationary triangular-distribution with 
probability one. 

Example 3. The distribution function for the exponential distribution with expected 
value fi = A > 0, satisfies F(x) = 1 — e~ Xx , x > 0. A Markov chain generated 
by random iterations with the two maps fi = /f and f 2 = f% defined as in Theorem^ 
has the exponential distribution with expected value fi as its stationary distribution. We 
can construct interesting "new" distributions by altering such Markov chains in various 
ways, e.g. by altering the application of two IFSs corresponding to different parameter 
values. A result of such a construction is shown in the figure. 
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The upper figures are histograms of the first 200000 points in simulations of a trajectory 
of a Markov chain generated by random iterations with the two maps f\ = /{* and f 2 = f 2 
defined as in Theorem [1\ corresponding to the choices [L — 1 in the left hand figure and 
// = 2 in the righthand figure respectively. 

The lower figures are histograms corresponding to trajectories of Markov chains formed 
by random iterations with the maps gi(x) = fi(fl{x)), g2(x) = fi{.f2(. x ))> 93( x ) = 
fl(ft(x)), 9a{x) = fliflix)) and h x {x) = fl(f?(x)), h 2 {x) = ft(fi(x)), h 3 (x) = 
f 2 (fi(x)), h^x) = f 2 (f 2 (x)) respectively, where in both cases the functions are cho- 
sen uniformly at random. 

Remark 5. The distributions constructed in the lower figures in the example above are 
1— variable mixtures of the exponential distributions with expected values \l = \, and 
fi = 2 respectively. We can, more generally, for any integer V > 1, generate V— variable 
mixtures between continuous distributions. See Barnsley et al. j2] and [3] for more on 
the theory of V— variable sets and measures. 
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